97 research outputs found
Layer-Wise Relevance Propagation for Explaining Deep Neural Network Decisions in MRI-Based Alzheimer's Disease Classification
Deep neural networks have led to state-of-the-art results in many medical imaging tasks including Alzheimerâs disease (AD) detection based on structural magnetic resonance imaging (MRI) data. However, the network decisions are often perceived as being highly non-transparent, making it difficult to apply these algorithms in clinical routine. In this study, we propose using layer-wise relevance propagation (LRP) to visualize convolutional neural network decisions for AD based on MRI data. Similarly to other visualization methods, LRP produces a heatmap in the input space indicating the importance/relevance of each voxel contributing to the final classification outcome. In contrast to susceptibility maps produced by guided backpropagation (âWhich change in voxels would change the outcome most?â), the LRP method is able to directly highlight positive contributions to the network classification in the input space. In particular, we show that (1) the LRP method is very specific for individuals (âWhy does this person have AD?â) with high inter-patient variability, (2) there is very little relevance for AD in healthy controls and (3) areas that exhibit a lot of relevance correlate well with what is known from literature. To quantify the latter, we compute size-corrected metrics of the summed relevance per brain area, e.g., relevance density or relevance gain. Although these metrics produce very individual âfingerprintsâ of relevance patterns for AD patients, a lot of importance is put on areas in the temporal lobe including the hippocampus. After discussing several limitations such as sensitivity toward the underlying model and computation parameters, we conclude that LRP might have a high potential to assist clinicians in explaining neural network decisions for diagnosing AD (and potentially other diseases) based on structural MRI data
COVID-19: a simple statistical model for predicting intensive care unit load in exponential phases of the disease
One major bottleneck in the ongoing COVID-19 pandemic is the limited number of critical care beds. Due to the dynamic development of infections and the time lag between when patients are infected and when a proportion of them enters an intensive care unit (ICU), the need for future intensive care can easily be underestimated. To infer future ICU load from reported infections, we suggest a simple statistical model that (1) accounts for time lags and (2) allows for making predictions depending on different future growth of infections. We have evaluated our model for three heavily affected regions in Europe, namely Berlin (Germany), Lombardy (Italy), and Madrid (Spain). Before extensive containment measures made an impact, we first estimate the region-specific model parameters, namely ICU rate, time lag between infection, and ICU admission as well as length of stay in ICU. Whereas for Berlin, an ICU rate of 6%, a time lag of 6 days, and a stay of 12 days in ICU provide the best fit of the data, for Lombardy and Madrid the ICU rate was higher (18% and 15%) and the time lag (0 and 3 days) and the stay in ICU (3 and 8 days) shorter. The region-specific models are then used to predict future ICU load assuming either a continued exponential phase with varying growth rates (0-15%) or linear growth. By keeping the growth rates flexible, this model allows for taking into account the potential effect of diverse containment measures. Thus, the model can help to predict a potential exceedance of ICU capacity depending on future growth. A sensitivity analysis for an extended time period shows that the proposed model is particularly useful for exponential phases of the disease
Promises and pitfalls of deep neural networks in neuroimaging-based psychiatric research
By promising more accurate diagnostics and individual treatment
recommendations, deep neural networks and in particular convolutional neural
networks have advanced to a powerful tool in medical imaging. Here, we first
give an introduction into methodological key concepts and resulting
methodological promises including representation and transfer learning, as well
as modelling domain-specific priors. After reviewing recent applications within
neuroimaging-based psychiatric research, such as the diagnosis of psychiatric
diseases, delineation of disease subtypes, normative modeling, and the
development of neuroimaging biomarkers, we discuss current challenges. This
includes for example the difficulty of training models on small, heterogeneous
and biased data sets, the lack of validity of clinical labels, algorithmic
bias, and the influence of confounding variables
Harnessing spatial homogeneity of neuroimaging data: patch individual filter layers for CNNs
Neuroimaging data, e.g. obtained from magnetic resonance imaging (MRI), is
comparably homogeneous due to (1) the uniform structure of the brain and (2)
additional efforts to spatially normalize the data to a standard template using
linear and non-linear transformations. Convolutional neural networks (CNNs), in
contrast, have been specifically designed for highly heterogeneous data, such
as natural images, by sliding convolutional filters over different positions in
an image. Here, we suggest a new CNN architecture that combines the idea of
hierarchical abstraction in neural networks with a prior on the spatial
homogeneity of neuroimaging data: Whereas early layers are trained globally
using standard convolutional layers, we introduce for higher, more abstract
layers patch individual filters (PIF). By learning filters in individual image
regions (patches) without sharing weights, PIF layers can learn abstract
features faster and with fewer samples. We thoroughly evaluated PIF layers for
three different tasks and data sets, namely sex classification on UK Biobank
data, Alzheimer's disease detection on ADNI data and multiple sclerosis
detection on private hospital data. We demonstrate that CNNs using PIF layers
result in higher accuracies, especially in low sample size settings, and need
fewer training epochs for convergence. To the best of our knowledge, this is
the first study which introduces a prior on brain MRI for CNN learning
Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional MRI using layer-wise relevance propagation
Machine learning-based imaging diagnostics has recently reached or even
superseded the level of clinical experts in several clinical domains. However,
classification decisions of a trained machine learning system are typically
non-transparent, a major hindrance for clinical integration, error tracking or
knowledge discovery. In this study, we present a transparent deep learning
framework relying on convolutional neural networks (CNNs) and layer-wise
relevance propagation (LRP) for diagnosing multiple sclerosis (MS). MS is
commonly diagnosed utilizing a combination of clinical presentation and
conventional magnetic resonance imaging (MRI), specifically the occurrence and
presentation of white matter lesions in T2-weighted images. We hypothesized
that using LRP in a naive predictive model would enable us to uncover relevant
image features that a trained CNN uses for decision-making. Since imaging
markers in MS are well-established this would enable us to validate the
respective CNN model. First, we pre-trained a CNN on MRI data from the
Alzheimer's Disease Neuroimaging Initiative (n = 921), afterwards specializing
the CNN to discriminate between MS patients and healthy controls (n = 147).
Using LRP, we then produced a heatmap for each subject in the holdout set
depicting the voxel-wise relevance for a particular classification decision.
The resulting CNN model resulted in a balanced accuracy of 87.04% and an area
under the curve of 96.08% in a receiver operating characteristic curve. The
subsequent LRP visualization revealed that the CNN model focuses indeed on
individual lesions, but also incorporates additional information such as lesion
location, non-lesional white matter or gray matter areas such as the thalamus,
which are established conventional and advanced MRI markers in MS. We conclude
that LRP and the proposed framework have the capability to make diagnostic
decisions of..
Nucleus basalis of Meynert predicts cognition after deep brain stimulation in Parkinson's disease
INTRODUCTION
Subthalamic DBS in Parkinson's disease has been associated with cognitive decline in few cases. Volume reduction of the nucleus basalis of Meynert (NBM) seems to precede cognitive impairment in Parkinson's disease. In this retrospective study, we evaluated NBM volume as a predictor of cognitive outcome 1 year after subthalamic DBS.
METHODS
NBM volumes were calculated from preoperative MRIs using voxel-based morphometry. Cognitive outcome was defined as the relative change of MMSE or DemTect scores from pre-to 1 year postoperatively. A multiple linear regression analysis adjusted for the number of cognitive domains affected in the preoperative neuropsychological testing and UPDRS III was conducted. To account for other variables and potential non-linear effects, an additional machine learning analysis using random forests was applied.
RESULTS
55 patients with Parkinson's disease (39 male, age 61.4 ± 7.5 years, disease duration 10.8 ± 4.7 years) who received bilateral subthalamic DBS electrodes at our center were included. Although overall cognition did not change significantly, individual change in cognitive abilities was variable. Cognitive outcome could be predicted based on NBM size (B = 208.98, p = 0.022*) in the regression model (F(3,49) = 2.869; R2 of 0.149; p = 0.046*). Using random forests with more variables, cognitive outcome could also be predicted (average root mean squared error between predicted and true cognitive change 11.28 ± 9.51, p = 0.039*). Also in this model, NBM volume was the most predictive variable.
CONCLUSION
NBM volume can be used as a simple non-invasive predictor for cognitive outcome after DBS in Parkinson's disease, especially when combined with other clinical parameters that are prognostically relevant
Altered Coupling of Psychological Relaxation and Regional Volume of Brain Reward Areas in Multiple Sclerosis
Background:Psychological stress can influence the severity of multiple sclerosis (MS), but little is known about neurobiological factors potentially counteracting these effects. Objective:To identify gray matter (GM) brain regions related to relaxation after stress exposure in persons with MS (PwMS). Methods:36 PwMS and 21 healthy controls (HCs) reported their feeling of relaxation during a mild stress task. These markers were related to regional GM volumes, heart rate, and depressive symptoms. Results:Relaxation was differentially linked to heart rate in both groups (t= 2.20,p= 0.017), i.e., both markers were only related in HCs. Relaxation was positively linked to depressive symptoms across all participants (t= 1.99,p= 0.045) although this link differed weakly between groups (t= 1.62,p= 0.108). Primarily, the volume in medial temporal gyrus was negatively linked to relaxation in PwMS (t= -5.55, p(family-wise-error(FWE)corrected)= 0.018). A group-specific coupling of relaxation and GM volume was found in ventromedial prefrontal cortex (VMPFC) (t= -4.89, p(FWE)= 0.039). Conclusion:PwMS appear unable to integrate peripheral stress signals into their perception of relaxation. Together with the group-specific coupling of relaxation and VMPFC volume, a key area of the brain reward system for valuation of affectively relevant stimuli, this finding suggests a clinically relevant misinterpretation of stress-related affective stimuli in MS
Large scale multifactorial likelihood quantitative analysis of BRCA1 and BRCA2 variants: An ENIGMA resource to support clinical variant classification
The multifactorial likelihood analysis method has demonstrated utility for quantitative assessment of variant pathogenicity for multiple cancer syndrome genes. Independent data types currently incorporated in the model for assessing BRCA1 and BRCA2 variants include clinically calibrated prior probability of pathogenicity based on variant location and bioinformatic prediction of variant effect, co-segregation, family cancer history profile, co-occurrence with a pathogenic variant in the same gene, breast tumor pathology, and case-control information. Research and clinical data for multifactorial likelihood analysis were collated for 1,395 BRCA1/2 predominantly intronic and missense variants, enabling classification based on posterior probability of pathogenicity for 734 variants: 447 variants were classified as (likely) benign, and 94 as (likely) pathogenic; and 248 classifications were new or considerably altered relative to ClinVar submissions. Classifications were compared with information not yet included in the likelihood model, and evidence strengths aligned to those recommended for ACMG/AMP classification codes. Altered mRNA splicing or function relative to known nonpathogenic variant controls were moderately to strongly predictive of variant pathogenicity. Variant absence in population datasets provided supporting evidence for variant pathogenicity. These findings have direct relevance for BRCA1 and BRCA2 variant evaluation, and justify the need for gene-specific calibration of evidence types used for variant classification
- âŠ